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(54) Vascular specific regulatory elements contained in the desmin 5' flanking region 

(57) The present invention concerns a DNA 
sequence isolated from the 5' flanking region of a 
desmin gene located between nucleotides -10000 to - 
1000 relative to the transcription initiation site, which 
contains regulatory elements specific to arterial smooth 
muscle cells. It also provides an expression cassette 
using such a DNA sequence to control expression of a 
gene of interest. Finally, the invention relates to a 
recombinant vector, a viral particle, an eukaryotic host 
cell, a pharmaceutical composition comprising said 
expression cassette and their therapeutic or prophylac- 
tic use as well as a transgenic animal having incorpo- 
rated in its genome said expression cassette. 
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Description 

[0001] The present invention concerns a DNA sequence isolated from the 5* flanking region of a desmin gene 
located between nucleotides -10000 to -1000 relative to the transcription initiation site, which contains regulatory ele- 

5 ments specific to arterial smooth muscle cells. It also provides an expression cassette using such a DNA sequence to 
control expression of a gene of interest. Finally, the invention relates to a recombinant vector, a viral particle, an eukary- 
otic host cell, a pharmaceutical composition comprising said expression cassette and their therapeutic or prophylactic 
use as well as a transgenic animal having incorporated in its genome said expression cassette. The present invention 
relates to the field of tissue-specific gene expression and has numerous applications particularly in human diseases. It 

10 is of very special interest in relation to prospect for gene therapy, especially in the cardiovascular field. 

[0002] Gene therapy can be defined as the transfer of genetic material into a cell or an organism to treat or prevent 
a genetic or acquired disease. The possibility of treating human diseases by gene therapy has changed in a few years 
from the stage of theorical considerations to that of clinical applications. The first protocol applied to man was initiated 
in the USA in 1 990 on a patient who was genetically immunodef icient as a result of a mutation affecting the gene encod- 

15 ing adenine deaminase (ADA). The relative success of this first experiment encouraged the development of this tech- 
nology for various genetic and acquired diseases. The large majority of the current protocols employ vectors to carry 
the therapeutic gene to the cells to be treated. Numerous viral or synthetic vectors have been developed during these 
last years. Generally speaking, their structure, organization and biology are described in the literature available to a per- 
son skilled in the art. 

20 [0003] However, under certain circumstances, the broad host range of these vectors can represent a major limita- 
tion for their use. Their lack of specificity could lead to a widespread expression of therapeutic genes which might be 
harmful to the patient especially when cytotoxic genes are transferred. On the other hand, the efficiency of the treat- 
ment could also be reduced by the overall dilution of the vector in the organism. Several strategies have been proposed 
to restrict expression to the category of cells to be treated. The use of tissue-specific regulatory elements represent an 

25 attractive approach to overcome this problem. 

[0004] Cardiovascular diseases represent a major target for gene therapy approaches since they rare a leadings 
cause of mortality in most of the industrialized countries. The general concept is based on the delivery of therapeutic^ 
genes locally or in the systemic circulation. A promoter capable to target expression in vascular smooth muscle cells ^ 
might be beneficial to avoid the potential side effects inherent to a widespread gene expression. The existence of mus- 

30 cle eel! specific promoters has been described in the literature. However, expression is rarely restricted to a particular 
muscle type. Thus, there is still a need for specific regulatory elements that are functional in vivo and are specific of 
subtypes of smooth muscle cells to allow treatment of cardiovascular pathologies. 

[0005] TTie transcriptional regulation of tissue-specific genes is usually dependant of the interaction between trans- 
acting transcription factors and c/s-acting sequences present in the promoter, some of them being tissue-specific. A 

35 first family of myogenic trans -acting factors (MyoD) responsible for skeletal muscle expression was characterized at the 
end of the eighties (MyoD, myogenin, myf-5, MRF4...). Their binding to a c/s-acting element called E box (consensus 
CANNTG) promotes transcriptional activation of sequences controled by promoters including such a regulatory ele- 
ment. In addition, other transcription factors have also been involved in activation of muscle-specific genes, such as 
CArG box binding factor (or SRF), MCAT binding protein, and MEF-2 factors.... 

40 [0006] The desmin gene encodes a cytoskeletal protein constitutive of intermediate filaments which occur in the 
cytoplasm of most muscular cells. Desmin expression is initiated at a very early stage of embryogenesis in the myo- 
genic lineage and maintained in adult stage in all muscular type cells (cardiac, skeletal and smooth muscle cells). The 
desmin gene and its promoter region were cloned and sequenced for several species (Li et al., 1989, Gene 78, 243- 
254 reporting the human gene sequence. Quax et al., 1984, Proc Natl Acad Sci USA 81, 5970-5974 et Quax et al., 

45 1985, Cell 43, 327-338 for the hamster desmin gene; Li et al., 1996, Dev Biol. 175, 362-366 for the mouse gene). 
[0007] Previous studies have revealed a complex regulatory mechanisms to promote the temporal and tissue-spe- 
cific expression of the desmin gene during myogenesis. Positive and negative control regions were identified in the first 
1000 bp of the 5' flanking region of the human desmin gene. In particular, a muscle-specific enhancer was localised 
between nucleotides (nt) -973 to -693 (relative to the transcriptional initiation site or cap site representing +1). Its capa- 

so bility to activate either homologous (desmin) or heterologous promoters in a manner independant of orientation, posi- 
tion or distance was demonstrated by in vitro transfection assays using a series of CAT expression vectors containing 
different lengths of 5' upstream desmin region (Li and Paulin, 1991, J. Biol. Chem. 266, 6562-6570). Transgenic mice 
analysis showed that 1 kb of the desmin 5* upstream sequence are sufficient to direct muscle-specific and developmen- 
tatiy regulated expression in skeletal muscles and suggested a distinct regulation for cardiac and smooth muscle 

55 expression (Li etal., 1993, Development 117, 947-959) 

[0008] Sequence analysis of the mouse desmin promoter region also demonstrated the presence of several poten- 
tial c/s-acting elements. Three MyoD binding sites, one MEF-2 binding site, one M-CAT motif and several GC boxes as 
well as a muscle-specific enhancer were characterized within the 976pb preceeding the transcription initiation site (Li 
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and Capetanaki, 1993, Nucleic Acids Res. 21, 335-343). The proximal E box and enhancer MEF-2 binding sites as well 
as their location relative to the TATA box are crucial for high levels expression (Li and Capetanaki, 1994, EMBO J. 13, 
3580-3589). Concerning the hamster desmin gene, Van de Klundert et al (1994, J. Biol. Chem. 269, 220-225) have 
reported the importance of the consensus E box element located in the direct vicinity of the cap site (-80) for promoter 

5 activity whereas the contribution of a second E box situated at -825 is only moderate. 

[0009] Documents of the prior art have focused on the proximal portion of the desmin promoter and disclose its 
capability to induce expression of heterologous sequences mainly in skeletal muscles. The present invention goes 
beyond this teaching in providing regulatory elements specific for expression in arterial smooth cells present within a 
more distal part of the desmin promoter. Recombinant constructions driven by a promoter comprising the region -4006 

io to -2495 of the mouse desmin gene show a specificity of expression restricted to arterial smooth muscle cells. 
Sequence analysis reveals the presence of several potential transcription factors binding sites including 5 CArG-like 
sequences known to regulate many muscle specific genes especially cardiac and skeletal a-actin gene. Furthermore 
sequence comparison between human and murine desmin 5' flanking sequences shows 4 conserved regions between 
the two species, one being located in the arterial smooth muscle specific putative region. This region has special inter- 

is est in cardiovascular diseases field. 

[0010] Accordingly a first object of the present invention is an isolated DNA sequence comprising all or pan of the 
5' flanking region of a desmin gene located between approximately nucleotides -1 0000 to -1 000 relative to the transcrip- 
tion initiation site of said desmin gene or being hybridizable under stringent conditions to said 5' flanking region. 
[001 1 ] The term "isolated DNA sequence" refers to a DNA molecule that is separated from its natural context which 

20 means from its naturally occuring chromosome of the organism from which it is derived. For example, the isolated DNA 
sequence of the invention may be linked to an heterologous coding sequence, inserted into a vector or integrated into 
the genome of an organism. 

[001 2] What is meant by "desmin gene" is fully described in the state of the art and summarized in the irttroductive 
part of the present application. It comprises coding segments that encodes desmin protein and non coding segments 
25 present upstream (5* flanking region), downstream (3* flanking region) and between the coding segments (intron). The 
gene encoding desmin including its 5' flanking region can be isolated from a commercially available genomic library 
using an appropriate probe (i.e. cDNA encoding desmin or a portion thereof) or by other conventional molecular biology 
methods (Polymerase Chain reaction, restriction cutting...). 

[0013] The term "5' flanking region" defines the region that is found upstream of the transcrition initiation site of a 
30 gene. The transcription initiation site is the site at which the first nucleotide of the corresponding transcript is incorpo- 
rated (where RNA synthesis starts). It can be determined by standard methods such as RNase protection assay. In cer- 
tain cases, several transcription initiation sites can be identified for a given gene. In this case, transcription initiation site 
defines the major one which allow the production of the majority (in quantity) of the transcript species. The 5' flanking 
region contains usually the promoter and at least some of the regulatory elements that regulate (induce or repress) 
35 expression of the coding segments under specified conditions. It is within the reach of the man skilled in the art to iso- 
late the specified portion of the desmin 5' flanking region by conventional molecular biology techniques from a genomic 
library, a desmin gene or from a prior art plasmid. 

[0014] The term "approximately" means within a margin of commonly acceptable error for the determination of the 
size of the DNA sequence being made by standard methods (i.e agarose gel electrophoresis...) or within minor varia- 
40 tions for the positions given. A variation of ±1 0% is acceptable according to the present invention. As used hereinafter, 
the positions within the 5' flanking region of a desmin gene are given relative to the transcription initiation site which rep- 
resents nt+1. 

[0015] The term "hybridizable under stringent conditions" is conventional in the domain of the art. Such conditions 
can be found in technical laboratory manuals, such as Maniatis et al., (1982, Molecular cloning, A Laboratory Manual, 

45 Cold Spring Harbor, Laboratory Press, Cold Spring Harbor, NY) or its up to date editions. Preferably, the hybridation 
conditions are according to the ones described by Maniatis (1 982 edition). Three washings are then performed at 65°C 
in the presence of 0,2 SSC et 0.1 % SDS and without formamide, in order to eliminate the non-hybridized fragments. 
These conditions denote a percentage of identity between the DNA sequence of the invention and the defined portion 
of the native 5' flanking desmin region of at least 70%, advantageously 80%, preferably 90% and more preferably of 

so 100%. 

[001 6] For the purpose of the invention, the DNA sequence of the invention comprises one or more regulatory ele- 
ments capable alone or in combination, directly or by means of other cellular factors, to allow expression of a gene of 
interest essentially in muscle cells, advantageously, in vascular smooth muscle cells and preferably in arterial smooth 
muscle ceils and more especially in the aorta and the pulmonary artery. Although background expression can be 
55 observed in other tissues (i. e. other muscle cells, fibroblast...etc), expression is observed in majority in the precited tar- 
get cells. Such a specificity can be determined by in vitro transfection assay in different cell types using a gene whose 
product is easily detectable (LacZ, luciferase, al-antitrypsin, factor IX, CFTR..). Another way to proceed is to generate 
transgenic mice and evaluate the level of transgene transcript or product in the different tissues. 
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[0017] However, it is also feasible to employ a functional equivalent of the DNA sequence of the invention having a 
similar function. Such an equivalent may be obtained by mutation (addition, deletion and/or substitution of one or more 
nt) or truncation of the native desmin 5* flanking sequences. To illustrate this embodiment, one may refer to a DNA 
sequence deleted of the negative control regions. It is also possible to mutate or delete some of the c/s-acting 
5 sequences to improve specificity towards vascular or arterial cells or, optionnaily, decrease specificity towards other tis- 
sues. In addition, the DNA sequence of the present invention may comprise exogenous sequences preferably at its 5' 
or 3' or both 5' and 3* extremities. The capability of the functional equivalent to allow expression of a reporter gene in 
the target cells can be evaluated in vitro or in vivo as indicated in the example part. 

[00181 According to an advantageous embodiment, the isolated DNA sequence of the invention comprises all or 
10 part of the 5' flanking region located between nucleotides -5000 to -1 000 relative to the transcription initiation site of said 
desmin gene. 

[0019] For the purposes of the present invention, the desmin gene in use in the present invention, may be from any 
origin, for example from human, canine, avian, bovine, murine (rat, mouse, hamster...), ovine, feline, porcine or simian 
origin! However, the DNA sequence of the invention preferably derives from a human or a mouse desmin gene. As 
already mentioned, the 5' flanking region of this two genes is cloned and is availble from GenEMBL (under accession 
number Z18892 for the mouse gene and M63391 for the human gene). 

[0020] According to a preferential embodiment, the DNA sequence of the invention comprises all or part of the 5' 
flanking region located between approximately nucleotides -4006 to -2495 relative to the transcription initiation site of 
the mouse desmin gene. In this consideration, it preferably comprises all or part of the sequence shown in SEQ ID NO: 
1 or a sequence being hybridizable under stringent conditions. An appropriate part is constituted by at least 100 conti- 
nous nt of said SEQ ID NO: 1 and, preferably, at least 200 nt. Of course the DNA sequence may include other 
sequences homologous or heterologous relative to the desmin gene. In this consideration a DNA sequence deriving 
from the 5' flanking region located between approximately -4006 to +603 of the mouse desmin gene is preferred. 
[0021] According to another alternative, the DNA sequence of the invention comprises all or part of the 5' flanking 
region located between approximately nucleotides -4000 to -2272 relative to the transcription initiation site of the human 
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desmin gene. 

[0022] Another object of the present invention is an expression cassette comprising a gene of interest placed under 
the control of the elements necessary to its expression in an eukaryotic host cell, said elements comprising at least an 
isolated DNA sequence according to the invention. 

30 [0023] The expression cassette of the invention can includes other elements facilrting maintenance and expression 
of the gene of interest in host cells, especially from myogenic lineage. In particular, such elements can include a pro- 
moter which is operatively linked to the isolated DNA sequence of the invention. Such a promoter is preferably a mini- 
mal promoter comprising the c/s-acting sequences allowing RNA polymerase recognition and binding (TATA box) and 
optionally (if not present in the gene of interest) a transcription initiation site functional in the targeted host cells. 

35 [0024] Such a promoter may be isolated from any gene of eukaryotic or viral origin. A first possibility is to employ a 
promoter derived from a desmin gene and comprised in the 5' flanking region immediately upstream of the transcription 
initiation site and optionally 5* untranslated region of said desmin gene. As far as mouse desmin gene is concerned, 
such minimal promoter is comprised between approximately nucleotide -87 to -1 of the 5* flanking region of the mouse 
desmin gene, but functional variants (mutated, truncated) can also be used. Another alternative is to use a heterolo- 

40 gous promoter. This latter may be choosen among those of the prior art, for example derived from RSV (Rous Sarcoma 
Virus), SV40 (Simian Virus 40). HSV (Herpes simplex virus)-TK (thymidine kinase) or CMV (cytomegalovirus) promoter 
regions, or any promoter especially active in the targeted cell type, especially the minimal promoters of SM22, SM cc- 
actin, SM-MHC, telokn ... etc genes.. 

[0025] In order to stabilize expression, it may be advantageous that the expression cassette comprises at least an 
exon, preferably a non coding exon, and optionally an intron. These splicing sequences may derive from the desmin 
gene, from any eukaryoze gene or from a synthetic sequence. The large variety of splicing sequences described in the 
state'of the art are suitable in the context of the invention. One may dte more particularly those isolated from genes 
encoding a or p globin, apolipoprotein, immunoglobulin, factor IX, factor VIII, CFTR and from pCI vector (Promega). 
They are preferably inserted after the transcription initiation site and before the coding sequences. According to the pre- 
so ferred embodiment referring to the mouse desmin gene, one indicates that an appropriate untranslated exon portion 
extends to approximately nucleotide +79. It is also feasible to include a desmin translated portion to improve translation 
(i.e. the portion extending to approximately nucleotide +603 of the mouse desmin gene) that is then fined in correct 
reading frame to the coding sequence of the gene of interest. 

[0026] According to the invention, an expression cassette using mouse desmin sequence -4006/+603 operatively 
55 linked to a gene of interest is preferred. 

[0027] Finally, the expression cassette of the invention may also include a transcription termination signal (poly A) 
placed downstream of the gene of interest (polyA of SV40. ..). 

[0028] The gene of interest in use in the present invention can be of any origin (prokaryote, eukaryote. viral...) and 
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isolated from a genomic DNA, a cDNA or be of mixed type (minigene). It can encode an antisense RNA, a ribozyme or 
a polypeptide. This latter can be a mature polypeptide, a precursor (i.e. precursor intended to be secreted and compris- 
ing a signal sequence, a precursor intended to be maturated by proteolytic cleavage...), a fragment of a protein (trun- 
cated protein), a chimeric polypeptide originating from the fusion of diverse sequences or a mutated polypeptide 
5 displaying improved and/or modified biological properties. The gene may be isolated from any organism or cell by the 
conventional techniques of molecular biology (PGR, cloning with appropriate probes, chemical synthesis) and if needed 
its sequence may be modified by mutagenesis, PCR or any other protocol. 

[0029] Among the suitable genes of interest, one may cite those encoding a polypeptide selected from the group 
consisting of a cytokine, a chemokine, a membrane or nuclear receptor, a ligand, a clotting factor, a clotting inhibitor, 
10 the CFTR protein, insulin, dystrophin, a growth factor (FGF for fibroblast growth factor, NGF for nerve growth factor, 
VEGF for vascular endothelial growth factor), an enzyme, an enzyme inhibitor, an apoptosis inducer, an apoptosis inhib- 
itor, a cytostatic agent, an apolipoprotein, an oxygen radical scaveyer, a cell-cycle control polypeptide, an antiprolifera- 
tive polypeptide, an angiogenesis inhibitor (angiostatin, endostatin, thrombospondin-L.), an immunogen, a antiviral 
polypeptide, an antibody, an antibody fragment, a toxin, an immunotoxin, a product of a suicide gene, a muscle contrac- 
ts tion inhibiting polypeptide and a marker. 

[0030] According to a preferred embodiment, the gene of interest encodes a product capable to convert a prodrug 
to a cytotoxic drug. One may cite more particularly HSV-TK, which transforms ganciclovir to a guanosine analogue 
which inhibit DNA synthesis in the replicating cells (Caruso and Klatzmann, 1992, Proc. Natl. Acad. Sci. USA. 89, 182- 
186), ricin, a bacterial toxin, the expression product of yeast genes FCYI and/or FURI having UPRTase (Uracile Phos- 
20 phoribosyi Transferase) and CDase(Cytosine Desaminase) activities.... The gene of interest may also code for a 
polypeptide having a lipolytic activity (i.e. lipoprotein lipase ; see Reyner et a!., 1995, Nature Genetics 10, 28 ; Ming, 
1994, J. Biol. Chem 269, 1 1417). It may also code for a cytostatic polypeptide such as the expression product of Rb 
gene (Chang et al., 1995, Science 267, 518-522) or of eNos gene (Hamon et al., 1994, Circulation 90, 1357-1362). It 
can also code for a growth factor of endothelial cells, such as an interleukin. 
25 [0031 ] Another object of the present invention is a vector comprising an expression cassette according to the inven- 
tion. 

[0032] In the context of the present invention, it can be a plasmid, a synthetic or a viral vector. Plasmid denotes an 
extrachromosomic circular DNA capable of autonomous replication in a given cell. The choice of the plasmids in use 
here is very large. It is preferably designed for amplification in bacteria and expression in eukaryotic host cell. Such 

30 plasmids can be purchased in a variety of manufactors. Suitable plasmids include but are not limited to those derived 
from pBR322 (Gibco BRL), pUC (Gibco BRL), pBluescript (Stratagene), pREP4, pCEP4 (Invitrogene), pCI (Promega) 
and p Poly (Lathe et al., 1987, Gene 57, 193-201). It is also possible to engineer such a plasmid by molecular biology 
techniques (Maniatis et al., 1989, Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY). 
A plasmid may also comprise a selection gene in order to select or identify the transfected cells (by complementation 

35 of a cell auxotrophy, antibiotic resistance), stabilizing elements (i.e. cer sequence; Summers et Sherrat, 1984, Cell 36, 
1097-1 103) or integrative elements (LTR viral sequences). Furthermore, it can be complexed or associated to synthetic 
vectors (i.e. liposome, cationic lipids, polymers...). 

[0033] An expression vector may also be from viral origin and may be derived from a variety of viruses, such as her- 
pes viruses, cytomegaloviruses, AAV (adeno-associated virus), poxviruses (canarypox, fowlpox, vaccinia virus, MVA) 

40 and retroviruses. However, a vector which is specially suitable for the present invention is an adenoviral vector. 

[0034] For the purpose of the invention, the adenoviral vector of the invention is preferably replication-defective by 
deletion of at least ail or part of the E1 region. The deletion encompasses at least El a sequences and may extend in 
the E1 b transcription unit. Preferably the E 1 b sequences overlapping with pIX gene are not deleted. It goes without say- 
ing that the adenoviral backbone of the vector in use in present invention may comprise additional modifications (muta- 

45 tion/deletion/insertion) allocated to one or more viral genes. A partial or total deletion of E3 region may be 
advantageous. 

[0035] A vector of the invention may further comprise an origin of replication efficient in arterial cells, especially 
smooth muscle cells and/or sequences allowing homologous recombination in the host which can direct replacement 
of a defective gene by the gene of interest in use in the present invention. 

so [0036] Another object of the present invention is also a viral particle comprising an expression cassette or a vector 
according to the invention. It may be prepared according to any conventional technique in the field of the art. When an 
adenoviral vector is considered, one may proceed by cotransfection of suitable adenoviral fragments in a cell line such 
as 293 line (as described in Graham and Prevect, 1991, Methods in Molecular Biology, Vol 7, Gene Transfer and 
Expression Protocols ; Ed E J, Murray, The Human Press Inc, Clinton, NJ). It is also possible to reconstitute the vector 

55 in Escherichia coli by ligation or homologous recombination (as described in W096/1 7070) before transfecting the cell 
line. Furthermore, the virions may be amplified by passage in a permissive cell in order to generate a high titer viral 
stock that may be used in the preparation of clinical lots. It may be propagated in a complementation cell line, which 
supplies in trans the deleted/mutated viral functions. Line 293, established from human embryonic kidney cells (Gra- 
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ham et al., 1977, J. Gen. Virol. 36, 59-72) is commonly used to complement the E1 function. Alternatively, it is also pos- 
sible to use a helper virus supplying in trans some of the viral deficiencies. The viral particles may be recovered from 
the culture supernatant but also from the cells which can be lysed by a series of thawing/freezing cycles. Optionally, the 
virions may be amplified and purified according to the standard techniques (chromatography, ultracentrifugation for 
5 example in cesium chloride gradient....). 

[0037] The subject of the present invention is also an eukaryotic host cell that contains an expression cassette or 
a vector according to the invention or is infected by a viral particle according to the invention. Preferably, such a cell is 
from mammalian origin and, in particular from human origin. The vector may be inserted into the cellular genome or not 
(episome). Suitable cells include but are not limited to primary or tumoral cells from any origin (haematopoietic, muscu- 
lo lar, lung, tracheal, liver, epithelial, fibroblastic or endothelial origin). Advantageously, an eukaryotic host cell is a muscle 
cell, preferably a smooth muscle cell and especially a vascular smooth muscle cell. Among these latters, arterial smooth 
muscle cells are absolute preference and especially from aorta and pulmonary artery. 

[0038] The subject of the invention is also a composition comprising an expression cassette, a vector, a viral parti- 
cle or an eukaryotic cell according to the invention. It may further comprise one or more substances improving gene 

is transfer efficiency or stability. Such substances are well known in the art (see for example Feigner et al., 1987, Proc. 
West. Pharmacol. Soc. 32, 115-121; Hodgson et Solaiman, 1996, Nature Biotechnology 14, 339-342; Remy et al., 
1994, Bioconjugate Chemistry 5, 647-654, W094/27238, WO95/10534) and include in particular polymers, polypep- 
tides, cationic lipids, liposomes, nuclear proteins and neutral lipids. They may also be used in combination (i.e. cationic 
and neutral lipids). Such a substance may also facilitates gene delivery for a particular application. Examples of sub- 

20 stances susceptible to facilitate transfection in arterial cells include a gel of a complex of pply-lysine and lactose 
(Midoux et al., 1993, Nucleic Acids Res. 21, 871-878) or poloxamer 407 (Pastore, 1994, Circulation 90, 1-517). 
[0039] According to a preferred embodiment, the composition of the invention is a pharmaceutical composition 
comprising the precited agents and a pharmaceutical^ acceptable vehicle. It is intended especially for the preventive 
or curative treatment of genetic diseases (haemophilia, diabete, cystic ffcrosis, Duchenne or Becker myopathies, auto- _ 

25 immune diseases...) or acquired disorders (cancers, tumour, cardiovascular diseases, viral diseases such as AIDS or 
hepatitis B or C....). A preferred application is the prevention and treatment of cardiovascular disorders. 
[0040] The composition according to the invention may be manufactured in a conventional manner for localrgeneral^ 
or oral administration. Aerosol, inflation, injection as well as infusion and graft techniques may be envisaged. Suitable 
routes of administration include subcutaneous, intracardiac, intramuscular, intravenous, intraarterial, intraperitoneal, 

30 intratumorai, intrapuimonary, intratracheal routes.The administration may take place in a single dose or a dose repeated 
one or several times after a certain time interval. The appropriate administration route and dosage vary in accordance 
with various parameters, for example, with the individual, the disorder to be treated or with the gene of interest to be 
transferred. The viral particles according to the invention may be formulated in the form of doses of between 10 4 and 
10 14 iu (infectious unit), advantageously 10 5 and 10 13 iu and preferably 10 6 and 10 12 iu. The titer may be determined by 

35 conventional techniques. Doses based on vector may comprise between 0,01 and 100 mg of DNA, advantageously 
0,05 and 10 mg and preferably 0,5 and 5 mg. In addition, the agent of the composition may be combined with a vehicle 
which is acceptable from a pharmaceutical standpoint. The formulation may also include a diluent, an adjuvant or an 
excipient. It may be presented as a liquid directly injectable or as a dry powder (lyophylized ... etc) that can be reconsti- 
tuted before use. 

40 [0041] As far as cardiovascular diseases are concerned, the expression cassette, vector or viral particle may be 
delivered in vivo by specific means adapted to this pathology. One may use balloon catheter that will be inflated in the 
affected region (Feldman and Steg, 1 996, Medecine/Science 12 t 47-55). It is also feasible to deliver them dircetly to the 
arteries following surgical operation. Another possibility is to introduce along the affected artery a grid frame impreg- 
nated with the therapeutic agent (Feldman at al., 1995, J. Clin. Invest. 95, 2662-2671). Graft of smooth muscle cells are 

45 also possible. 

[0042] Another object of the present invention is the use of an expression cassette, a vector, a viral particle or an 
eukaryotic cell according to the invention to the manufacture of a drug intended for gene transfer into a host cell or 
organism and preferably for the treatment of human or animal body by gene therapy. A preferred use is the treatment 
or prevention of a cardiovascular disease, more especially disorders selected among the group consisting of restenosis 
so following balloon angioplasty, isehemia, intimal hyperplasia, artheroselerosis, hypercholesterolemia and smooth mus- 
cle cell proliferation. 

[0043] The invention also extends to a method of treatment according to which a therapeutically effective amount 
of an expression cassette, a vector, a viral particle or an eukaryotic cell according to the invention is administered to a 
patient in need of such a treatment. 
55 [0044] Another object of the present invention is a transgenic animal, especially a transgenic mouse, comprising an 
expression cassette or a vector according to the invention. Such an animal can be generated by conventional transgen- 
esis methods and may be used as a model to study the potential effect or activity of the gene of interest or the regulation 
of the DNA sequence of the invention. 
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[0045] The invention also concerns the use of a DNA sequence comprising all or part of a sequence located 
between approximately nucleotides -10000 to +600 of a desmin gene to target expression of a gene of interest opera- 
tively linked to said DNA sequence in a muscle cell selected among the group consisting of cardiac, skeletal and 
smooth muscle cell. A sequence comprised within -4006 to +603 of the mouse desmin gene is preferred. The use of a 

5 subfragment (i.e. -10000/-1000 or -4006/-977) operatively linked to an appropriate promoter is also possible 

[0046] Finally, the invention concerns the use of a DNA sequence comprising all or part of a sequence located 
between approximately nucleotides -5000 to -2400 of a mouse desmin gene to target expression of a gene of interest 
operatively linked to said DNA sequence and to a minimal promoter in an arterial smooth muscle cell and more espe- 
cially in the aorta and the pulmonary artery. A sequence comprised within -4006 to -2495 of the mouse desmin gene is 

io preferred.. 

[0047] The present invention is illustrated without being limited by the following examples. 

Figure 1 illustrates a comparison of the sequence of the murine and human desmin promoters. Highly conserved 
regions are shaded. Binding sites for transcription factors are boxed. Transcription start sites are indicated by 
is arrows and numbered +1 (second site for the human sequence). 

Figure 2 illustrates the sequence of the arterial SMC specific Desmin [-4006/-2495] region. The putative con- 
sensus binding site for transcription factors are boxed. Only the perfect consensus are shown for clarity except for 
the CArG-like sequences that diverge of I nucleotide on 10 compared to the CC(A/T) 6 GG consensus. 

20 

EXAMPLES 

[0048] The constructions described below are carried out according to the general techniques of genetic engineer- 
ing and molecular cloning detailed in Maniatis et a!., (1989, Laboratory Manual, Cold Spring Harbor, Laboratory Press, 
25 Cold Spring Harbor, NY) or according to the manufacture's recommendations when a commercial kit is used. Regarding 
the repair of restriction sites, the technique employed consists of filling in the 5' protuding ends using the large fragment 
of E. coil DNA polymerase 1 (Klenow). 

MATERIAL AND METHODS 

30 

[0049] The position within the desmin gene and the restriction sites positions (first nt in 5' of the restriction 
sequence) are given relative to the transcription initiation site (+1 ) of the desmin gene. 

Plasmid pE608 was constructed by insertion of a fragment EcoRI (-4005)-Sall (+603) upstream to the marker gene nls- 
lacZ. The nt G located in 5' of the EcoRI site of the pBluescript ks polylinker is similar to the nt G in position -4006 of 
35 the desmin gene. The Sail site (+603) of the desmin is ligated to Ncol site of nls-LacZ sequence after filling in by klenow 
polymerase, leaving the Sail site reconstructed whereas Ncol site is lost. The desmin sequence is conserved up to 
position +608 and residue Asp in position 176 is in the correct reading frame relative to the Met residue of nls-lacZ 
expression products. 

[0050] Plasmid pEBB608 was obtained by inserting the blunt ended EcoRI (-4005) - Bst Ell (-2500) fragment into 
40 BamHI digested and blunt ended pE608 (BamHI sites 5' polylinker and in position -87). As a result the fragment -4004/- 
2495 is fused to nt -86. Plasmid pEXE608 was obtained by inserting the blunt ended Xbal (5* polylinker and -3226) in 
pE608 deleted of fragment Eagl (5* polylinker and -243). In the resulting construct, the sequence -4006/-3222 is fused 
to position -242 of the desmin sequence. 

[0051] To generate pE78, pE608 was deleted of fragment BstEII (-2500)-Sall (+603). The deleted fragment is 
45 replaced by a PCR fragment obtained with a sens oligonucleotide overlapping sequence -2503/-2480 (containing BstEII 
site) and a reverse oligonucleotide overlapping sequence +78/+22, including a Sail site at its 5* extremity and eliminat- 
ing initiator ATG (+81/+83) of the desmin gene. The desmin sequence GCCACCAEQ is replaced by GCCAGTCGAC- 
C ATG including a Sail site. 

so Nuclear extracts 

[0052] All procedure were performed at 4°C. All buffers contain cocktail of inhibitors (SIGMA). Cell layer was 
washed twice and scraped in cold PBS. Cell pellet was resuspended in buffer containing 10 mM HEPES pH8, 50 mM 
NaCI, 500 mM sucrose, 1 mM EDTA, 0.5 mM spermidine, 0.15 mM spermine and 0.2 % triton X100 for 15 min. Cells 
55 were lysed with 0.2 % final NP40, 1 min and pelleted 3 min at 2800g. Nuclei pellet were resuspended once in buffer 
containing 10 mM HEPES pH8, 50 mM NaCI, 20 % glycerol, 1 mM EDTA, 0.5 mM spermidine, 0.15 mM spermine. After 
3 min centrifugation (2800 g), nuclei pellet was resuspended in an equal volume of the same buffer except that salt con- 
centration was 350 mM NaCI. To account for pellet volume dilution, 5 M NaCI was added to adjust final NaCI concen- 
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tration to 350 mM. The resuspended pellet in hypertonic buffer was rotated for 30 min before to be centrifugated 30 min 
(18300 g). Total protein content of the crude nuclear extract supernatant were quantified by the method of Bradford 
(Biorad assay) and was usually comprised between 3 to 5 mg/ml. Most of the time, crude nuclear extracts were directly 
used for EMSA. For footprint analysis, crude unclear extracts were concentrated by ammonium suKat precipitation 
before to be dialysed 4h in buffer containing 10 mM HEPES pH8, 60 mM KCI, 20 % glycerol, 0.25 mM EDTA, 0.125 mM 
EGTA, 1 mM PMSF, 1 mM DTT. 

EMSA 

Oligonucleotides used in EMSA analysis 

SRE 5' GGATGTCCATATTAGGACATCT 
mSRE 5' g c 



Elements CArG de la Desminc CArG Position 



CArCl 


5' 


AGGATCAGTACATATAAGGCATTATTGT 


-1390 


CArG2 


5' 


CATCAAGGACCTTTAGAGGAAAAAGGAG - 


^2572 


CArG 3 


5* 


CTAATAAAGCAAATAAAGGCCAATTTTA 


-3443 


CArG4 


5 1 


GCA A A AC ACCC AT A T ATGT AAAATAATT 


-3694 


mCArG4 


5 r 






CArG 5 


5' 


TTTAGAAAACCTTTATAGTCCAACCCCC 


-3910 


CArG6 


5 1 


CCTGGAATTCCTTTTTATGCAAATGAGC 


-4000 


fnCArGG 


5' 






Oligonucleotides de SM22 




SM-CArG1 


5' 


TGTCTTTCCCCAAATATGGAGCCTGTGT 


-150 


SM»CArG2 


5 f 


GGGTCCTGCCCATAAAAGGTTTTTCCCG 


-273 


SM-TG 


5' 


AGAAGGTGGCTTGTTTGTCCCTTGCAGGTTCC 


-399 



[0053] Each CArG-like oligonucleotides was radiolabeled and 12 f moles of 32 p oligonucleotide was incubated with 
4 ng of AUI VSMCS crude nuclear extracts. Competition was made by adding 50x molar ratio of cold oligonucleotide of 
the same time than the radiolabeled probe. 

RESULTS 

4006 bp of the murine desmin promoter are sufficient to direct cell specific expression of lacZ in cardiac, skel- 
etal and vascular smooth muscle cells during embryogenesis. 

[0054] Reporter genes used in transfection experiments were directly used after linearization to generate trans- 
genic mice that were examined for lacZ expression as founders and stable transgenic lines. We first introduced the 
larger construct extending from -4006 to +608 bp, namely MDes 4kb-nlz (pE608), to determine what tissue specific ele- 
ments were contained in that sequence. MDes 4kb-nlz showed to contain sufficient cis-regulatory sequences to direct 
tissue specific expression of desmin in the three types of muscle, i.e. striated cardiac and skeletal muscles and non- 
striated smooth muscle cells (SMC). 
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Cardiac expression 

[0055] Whole amount X-Gal staining for lacZ expression revealed that transgene expression was first detected at 
E7.5 in the late primitive streak embryo. The staining was localised in the bilateral precardiac mesoderm so approxi- 

5 mately at the time that the mesoderm cells of this region are fated to cardiac lineage. At E8.5 the transgene was 
expressed in afl regions of the S-shaped primitive heart with an equal intensity. The staining marked marked the outflow 
tract, the buibus cordis, the common ventricle, the common atrial chamber and the sinus venosus. Sections of E9.5 
embryos showed that the lacZ positive cells were found in the pericardium and the trabeculated myocardium but not in 
the endocardium. Desmin-LacZ transgene was never expressed in the endocardial cushion that forms the separation 

w of the atrial and ventricular chambers nor in the fibroblastic tissue of the valve leaflets as shown at E1 7.5. At each stage, 
the transgene expression reproduced endogenous desmin expression in the heart as demonstrated by co-localization 
of the LacZ staining and the immunoperoxydase signal decorating the desmin filaments. In conclusion, desmin trans- 
gene as the endogenous protein is specifically expressed in the muscle cells of the heart. High level of expression in 
the heart persisted throughout adult life. 

15 

Vascular Smooth muscle expression. 

[0056] The MDes-4kb-LacZ transgene was expressed very early in the smooth muscle cells layer immediately 
adjacent to the endothelium starting at E9.0, so one day after the appearance of the paired aortae. The paired aortae, 

20 the umbilical arteries and the vitelline artery all started to show lacZ staining around the same embryonic stage E9.0. 
In embryos with 20-22 somites one could see the lacZ staining in the aorta in a double gradient pattern. The strongest 
staining (or higher number of positive blue nuclei) was observed in both the more rostral paired aortae and the more 
caudal umbilical arteries. Bu contrast in the trunk region where the right and left aortae have fused in a single larger 
vessel, very few positive nuclei were seen at this early stage and the large majority of them were located on the ventral 

25 side of the vessel. From stage E10.5, the expression of the Mdes-nlz transgene increased in the dorsal aorta so that 
the aorta was stained all along its length including the dorsal side. At E12.5, the pattern of expression in the dorsal aorta 
was regular following the anteroposterior axis. X-Gal staining marked the carotid arteries and the basilar artery in the 
head at E1 1 .0. Aorta and all main arteries were stained at E15.5 and the transgene continued to be expressed in the 
arteries throughout adult life. 

30 A transitory expression was observed in the aortic arch arteries that followed the timing of their appearance and trans- 
formation into various vessels derivatives. In general less nuclei were stained in the aortic arches compared to the wall 
of the adjacent paired aortae. Staining in the first aortic arches was observed from E9.0 to E9.5. At this stare the stain- 
ing was more evident in the second aortic arches and was not visible anymore in E11.0 embryos with 35-40 somites. 
Third aortic arches began to be stained at E9.5 and number of positive nuclei increased until E11 .0. At this later stage, 

35 the fourth aortic arches displayed much more positive nuclei than the third aortic arches and only the branching point 
from the aortae to the sixth aortic arches were stained. In all embryos that were observed at stage El 1.0 the aortic 
arches on the right side of the embryo were less stained than on the left side. 

Delayed expression of the transgene in the veins. 

40 

[0057] Despite the fact that the main veins like the cardinal veins or the umbilical veins are formed at around the 
same stage than the arteries, we observed a delay in venous SMCs transgene activation ranging from one to 4 days 
compared to the arterial SMCs transgene initial expression. Transgene expression in the venous system was first 
detected in the umbilical veins at E10.5 and at slightly more advanced stage in the vitelline veins but was not clearly 

45 visible before E11 .0 to E1 1 .5 whereas umbilical and vitelline arteries were already strongly stained at E9.5. During afl 
embryonic development the venous vitelline and umbilical vessels displayed a rather sparse pattern of positive nuclei 
very different from the dense pattern of positive nuclei in the parallel arterial system including the yolk sac vasculature. 
Internal large veins like the anterior and posterior vena cavae or the hemyazygos vein were not stained before stage 
E13.5 and were clearly visible at E14.5. Expression of the transgene in the SMCs of the external venous vasculature 

so followed the same timing of activation than the interna! veins. 

Skeletal muscle expression 

[0058] Somite activate MDes-nlz expression in the myotomal compartment as soon as E9..0 following a rostro-cau- 
55 dal gradient reflecting the delayed maturation of the most posterior somites. For instance the last five somites in a 31 
somites embryo were not stained. Later the staining was found in the dermomyotomal cells including the thoracic der- 
momyotomal buds that elongate and invade the ventral body as in the more medial cells forming the axial myotome. 
Staining was also found in the muscle progenitor cells which had migrated from the somite to limb buds as soon as 
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E10.5. Staining intensified in the differentiating myofibers from E12.5 to later stages in the axial muscles, ventral body 
wall muscles and craniofacial muscles. Ocular muscles appeared to be stained from stage E9.5. Presumably all skel- 
etal muscles were stained by the p-gal activity at E15.5. 

5 Nervous system 

[0059] Beginning at E9.5, the transgene was also expressed in the head externally to the lateral walls of the mes- 
encephalic vesicle. This expression pattern extended anteriorly in the telencephalic vesicles at E12.5 but posteriorly the 
fourth vesicle stayed negative for lacz expression. However in the neural tube there was a paired region located in the 
io ependymal layer on each side of the central canal which showed transitory expression of p-gal from E9.5 to E12.5 
where it was restricted to the caudal pan of the neural tube. 

Extra-embryonic expression pattern 

is [0060] As early as stage E6. 5 and later on, lacZ expressing cells were also found in the surrounding decidual cells 
that are from maternal origin. Other sites of extra-embryonic expression were mainly the yolk sac vasculature some 
sparsed strongly positive cell in the amnion which did not form a vascular network. 

[0061 ] These results demonstrate that the MDes-nlz construct including 4006 pb 5' to the transcription initiation site 
and 608 bp more of the exon 1 , bears all the necessary cis-acting elements able to target tissue specific transcription 
x in cardiac, skeletal and vascular smooth muscle cells but the elements necessary for visceral smooth muscle. Also 
these results show a delayed activation of the desmin transgene in the venous SMCs compared to the arterial SMCs 
suggesting transcriptional SMC gene regulation as basis for morphological differences observed between the two types 
of vascular SMCs. 

25 A distal 1512 bp desmin promoter fragment (pEBB608) directs arterial but not venous SMCs expression 

[0062] A pEBB608 construct was derived from the MDes4kb-LacZ (pE608)s construct by deletion^ an internal ^ 
fragment [-2494/-S7] covering the skeletal enhancer region and the downstream half part of the second conserved?/ 
region with the human promoter (located between -3168 and -1960 of the murine gene). The pEBB608 transgene was 

30 analyzed in founder embryos at E15.5. Actually one embryo on three showed a clear dorsal aorta and intercostal ves- 
sels staining. Deletion of the fragment [-2494/-87] notably resulted in loss of staining in the heart except for a small pop- 
ulation of cells at the outer surface compared to a MDes-nlz embryo at the same stage. A faint specific staining was 
observed in some of the skeletal myofibers despite the absence of the skeletal enhancer. This could be attributed to the 
presence of an E-box at position -80, that was shown to confer low level myotube-specrfic transcription activity. The neu- 

35 ral tube staining pattern was slightly different in some founder embryos suggesting that the deleted desmin promoter is 
susceptible to neural tube cell expression without demonstrating a true specificity. 

[0063] Noteworthy was the fact that not ail the vessels were stained compared to an equivalent staged MDes4kb- 
nlz (pE608) embryo. Particularly, histological section analysis of the founder embryo showed that only the arterial SMCs 
of the aorta and the pulmonary artery but not the SMCs of the vena cava. No other vessels were stained by the X-gal 
40 reaction suggesting that the region still present in the deleted transgene are only active in the arterial SMCs of the aorta 
and the pulmonary system. This observation correlates with an independant regulatory program in the arterial and the 
venous SMCs. 

Promoter sequence analysis 

45 

Sequence Comparison of Murine and Human Desmin Promoter 

[0064] A DNA sequence comparison analysis was performed between the human and the murine desmin gene in 
order to identify the potentially important regulatory regions that would be conserved in the both species. The promoter 

so sequences of the murine (-4447/+69S) and the human (-4606/+88) desmin gene were compared using a window strin- 
gency algorythm (Compare, GCG Wisconsin University) and a parcimony analysis sequence comparison (Bestfit. QCG 
Wisconsin University). The dot plot analysis of comparison using the Compare algorythm showed four well conserved 
segments in the 5' flanking regions (Figure 1 , where the conserved regions are shaded) The two first regions lie imme- 
diately upstream of the initiation site of transcription and display a high level of nucleotide sequence conservation until 

55 -1437 upstream for the murine gene and -1542 for the human gene. The proximal promoter sequences include several 
conserved transcription factor binding sites like the MyoD I site lying -80 bp upstream of the transcription start site or 
the Sp1 site at -128 bp for the murine gene. Maximal sequence identity (98 %) was found in the region of the previously 
identified 275 bp long human skeletal muscle enhancer marked by brakets which is divided in two regions, i.e. the myo- 
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tube enhancer containing two Mt sites and the MEF2 binding site adjacent to the MyoD f site, the myoblast enhancer 
containing binding sites for Krox20724 and Spl factors (Li et al.71993, J. Biol. Chem. 268, 10403-10415). A second large 
fragment of sequence conservation was found between -3168 to -1960 bp in the murine promoter and - 2940 to -1682 
in the human promoter. Two regions display higher level of sequence identity, the first extending from -2196 to -1960 in 
5 the murine sequence which did not contain conserved consensus binding sites for known transcription factors, the sec- 
ond loss well conserved extending from -2946 to -2648 in which consensus binding sites for NFI, AP-1 and HNF-5 were 
conserved between the two species. 

Arterial SMC specific sequence analysis 

10 

[0065] Transgenic animals generated with a construct using the 1512 bp sequence between -4004 and -2495 (Fig. 
2) that is present in the transgene pEBB608 show an arterial SMC specificity. CArG box motifs are known to regulate 
many muscle specific gene, especially cardiac and skeletal a-actin gene, via interaction with SRF (Serum Transcription 
Factor) or other regulatory factors like Nkx 2.5 in the case of the cardiac a-actin gene. Of interest is the fact that 

is amongst 6 CArG-like sequences (90 % of sequence consensus to CC(A/T) 6 GG) present in the 4006 bp upstream 
region, 5 are located in the aortic SMCs specific putative region. We noticed that the CArG-2 motif at position -2572 in 
the murine sequence is not conserved in the human sequence whereas the slightly divergent CArG-l (ACATATAAGG) 
at position -1 390 in the murine sequence is conserved at 90 % in the human -1 479 position but with an even more diver- 
gent sequence (ACATGTAAGG) that makes it unlikely to bind SRF. All the 4 other CArG-like motifs in the murine 

20 sequence from CArG-3 to CArG-6 lie in a 600 bp long region between -4000 and -3400, so outside the conserved 
region with the human gene. Other perfect consensus for known transcription factors are shown in figure 2. Interestingly 
some of them were involved in smooth muscle cell gene regulation. Sp1 is known to regulate positively elastin and 
PDFGB-R gene in VSMCs and negatively the Sm-MHC gene. The Sp1 zinc finger factor is largely expressed in many 
tissues, however different expression level was noticed in pups versus adult rat VSMCs where the expression levels 

25 were reduced in quiescent SMCs but reinduced in proliferating PDGFB-Rpositive, SM-MHC negative SMCs following 
arterial wall injury. 

Electrophoretic mobility shift assay (EMSA) analysis of the murine CArG box sequences 

30 [0066] Given the known involvement of the SRF transcription factor in the arterial SMC-specif ic gene regulation and 
the cluster of the CArG motifs in the putative arterial SMC-specif ic region of the desmin promoter, we decided to search 
for potential true SRF binding sites amongst the candidate sequences. EMSA analysis was performed on the CArG 
boxes of the murine desmin gene. The 6 previously mentionned CArG boxes were chosen an the basis of a 90 % con- 
served nucleotide sequence relatively to the CC(A/T) 6 GG consensus and the conjoined presence of at least one dou- 

35 blet CC or GG at the extremity of the motif. Nuclear extracts from the AUI cell line were used for these experiments. AUI 
clone is derived from primary human aortic SMCs immortalized by a T-antigen SV40 promoter drived expression (Dr. 
Dandree). Those cell line express high level of endogenous desmin amongst other SMC-specif ic marker, a character- 
istic rather rare in the existing SMCs cell lines, 

40 CArG-4 box located at position -3694 bind a SRF-iike molecule 

[0067] In a first experiment all 6 CArG-boxes were radiolabeled with 32 P isotope and incubated with AUI nuclear 
extracts. Only CArG-4 and CArG-6 displayed one (C6) or two (C4) slowly migrating complexes of compatible size with 
a SRF containing complex. Mutations of the 5' CC doublet to CG and the 3' TG to TC in both the mCArG-4 and the 

45 mCArG-6 oligonucleotides abolished the binding of the slowly migrating complexes. To further elucidate the nature of 
the factors binding to the CArG-4 and CArG-6 motifs, known target sequences for SRF were tested in competition 
EMSA analysis. Competition was made by adding 50X molar ratio of cold oligonucleotide at the same time than the 
radiolabeled probe. CArG-4 probe displayed at least 2 specific slowly migrating complexes denoted A and B that were 
abolished by self-competition and not abolished by a mutated C4 oligonucleotide, and 2 faster complexes that were only 

so partly inhibited by self-competition. Formation of the A-complex was competed by the c-fos SRE, or the SM22 CArG-1 
and CArG-2 sequences but not by mutated SRE oligonucleotide, nor by the desmin CArG-6 or an unrelated SM22-TG 
oligonucleotide. The B-complex was only partly competed by CArG-6 and by the SM22 CArG-1 and CArG-2 
sequences. Radiolabeled CArG-6 bound one slow-migrating of similar size than the B complex with CArG-4 but more 
intense and two faster migrating complex. The B complex was specifically competed by itself and by CArG-4 but not by 

55 mC6, mC4 or the SM22 CArG boxes, nor by SRE or SM-TG. These results suggested that SRF or an SRF-like molecule 
is present in the CArG-4 A complex and possibly in the CArG-4 B complex whereas the CArG-6 B complex was devoid 
of an SRF molecule. Supershift analysis of the desmin CArG-4 and CArG-6 confirmed this hypotheses. Preincubation 
of 4 \ig of AUI VSMCs crude nuclear extracts with 2 jil of SRF antibody directed against the C-terminus of die molecule 
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(Santa Cruz sc-335X) upshifted the CArG-4 A-complex and the SRE complex migrating at the same size. We noticed 
that intensity of the bands in the upshifted complexes was decreased suggesting reduction of the SRF-DNA binding 
affinity by the specific antibody. Those complexes were not upshifted by the pre-immune serum as a control. In the case 
of CArG-4, the SRF antibody supershifted complex appeared as a doublet and the intensity of the B complex was 

5 slightly decreased suggesting that it contained a SRF molecule. The B-complex formed with the CArG-6 oligonucleotide 
was not supershifted by the SRF antibody nor reduced in signal intensity, showing that this CArG-6 complex did not con- 
tain SRF. The same experiment of upshift analysis by the SRF antibody was made using 5 jig of C2.7 myotubes dia- 
lysed nuclear extracts. The same results than with the AUI VSMCs nuclear extract were obtained showing that the SRF 
containing A and B complexes and the CArG-6 B-complex were present in both cell types. 

10 These results showed that the CArG-4 is able to bind a SRF-like molecule as well as another factor also bound by the 
CArG-6. Interestingly the CArG-6 motif overlapped a perfect ATGCAAAT consensus for Oct factor binding site and this 
sequence was partially conserved in the same place in the CArG-4 box with the sequence ATGTAAAA. The slightly 
divergent sequence in the CArG-4 box relative to the consensus should also explain the weaker affinity of the factor for 
the CArG-4 compared to the CArG-6 oligonucleotide. Moreover the same mutation that abolished the formation of the 

/5 SRF/CArG-4 complex also abolished the formation of the B complex with both the CArG-4 and the CArG-6 oligonucle- 
otides and one of this mutation changed the ATG5* part of the Oct-consensus in ATC. 

DISCUSSION 

20 Vascular Smooth muscle gene regulation 

[0068] The MDes4kb-LacZ (pE608) transgene displayed specific activity in vascular SMC expression but not in vis- 
ceral SMCs. A fragment of 1512 bp between -4004 and - 2495 when directly fused to a minimal desmin promoter at 
position -86 is able to drive lacZ expression in aortic and pulmonary arterial SMCs but not in venous SMCs at E1 5.5 in 
25 a founder embryo. All together these results reinforce the idea that different regulatory programs control gene regulation 
even from one single gene in the different SMC sub-types. Here we show that the MDes-4kb (pE608) transgenic line 
expressed lacZ in both the venous and the arterial SMCs and the deleted [-2494/-87) transgene expressed lacZ in a . 
subset of arterial SMCs but not in venous SMCs. 

The SMC regulatory program is likely to be slightly different between the murine and the human promoter. One first line 
30 of evidence is that CArG-4 box which binds SRF in the murine desmin promoter lies outside the conserved region with 
the human gene. 

Vascular smooth muscle ontogenic processus highlighted by the desmin-lacZ transgene expression 

35 [0069] The LacZ staining in the aorta appears at E9.0 in a double gradient pattern from each rostral and caudal 
paired vessel end. The number of positive nuclei decreases sharply in the trunk region where the right and left aortae 
have fused in a single larger vessel. Those nuclei are mainly located to the ventral side of the aorta. The preferred ven- 
tral localization of the positive nuclei in the developing fused aorta of the mouse embryo could be similar to the early 
congregation of mesoderm cells to the ventral aortic endothelium of the quail embryo and subsequently the initial 

40 expression of SM a-actin and IEI2 antigen in the ventraly located SMCs (Hungerford et ah. 1996). Finally the MDes4kb- 
LacZ (pE608) transgene delayed expression in the venous SMCs compared to the arterial SMCs is noticeable consid- 
ering the reduced number of smooth muscle cells layer in the veins compared to the arteries. Relative hypomorphic 
venous SMC development could relate to weaker activation of contractile SMC specific gene promoters. 

45 Skeletal muscle 

[0070] From U et al. (1993a, J. Biol. Chem. 266, 10403-10415; 1993b Development 1 17, 947-959), the regulation 
of the desmin gene in the embryonic myotomal precursor cells of the skeletal muscle is clearly understood and is due 
to a bipartite enhancer located between position -973 and -693 in the human sequence composed of a myotube specific 
so enhancer with a MEF-2 binding site and a MyoD binding site and a myoblast specific enhancer binding Krox and Sp1 
family members of transcription factor. This enhancer is highly conserved between in all species for which the sequence 
is known (hamster, rat, mouse). 

However, any of the human or the murine transgene containing only 5' flanking sequence were able to drive expression 
in the adult skeletal muscles that stopped between birth and two weeks depending on the transgenic lines with residual 
55 expression in-few dispersed nuclei of some muscles in the adult. Only when intronic and coding sequencex were added 
to the human transgene, adult skeletal muscle expression was maintained. This result suggests that there is a shift of 
skeletal muscle regulatory programm between embryonic and adult life, the former being mainly regulated by the skel- 
etal S'-located skeletal enhancer and the latter requiring an independant, or alternatively in addition to this enhancer, 
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regulatory elements located wether in introns or exons or both. 
Heart expression 

s [0071 ] Desmin is expressed very early in the cardiomyocytes precursors arising at E7.5 on each side of the anterior 
ventral mesoderm. Afterwhat, the gene expression is maintained at high level throughout the embryonic period as well 
as the adult life. MDes4kb»LacZ transgene recapitulated this pattern of expression in one transgenic line, while another 
transgenic line only displayed right ventricle expression. In contrast to the decrease and arrest of expression of the dif- 
ferent transgenes in the skeletal muscles of the adult mice, the transgenes whether it were from human or murine 

10 desmin gene origin, when they were expressed in the embryonic heart, the expression was maintained in the adult 
heart. 
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SEQUENCE LISTING 



(1) GENERAL. INFORMATION: 

(i) APPLICANT: 

IA) name: universite Paris vn 

(B) STREET: 2 place Jussieu 
tC) CITY: Paris cedex 05 

IB) COUNTRY: France 

(F) POSTAL CODE (ZIP): 75251 

(ii) title OF invention: Vascular specific regulatory elements 
contained in Lhe desmin 5* flanking region- 
al > NUMBER OF SEQUENCES: 1 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
<B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentm Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO; 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill! HYPOTHETICAL: NO 

<iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Mouse 



; . (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GGAATTCCTT TTTATGCAAA TGAGCCATTC CCAGCACCCT AGCCCCACCA ATCCTGTACT 
60 

CJW5ATAGCC3C. CAACCATCCC TAGAGGTTTT AGAAAACCTT TATAGTCCAA CCCCCAACTA 120 
AATGTGCTAC TTTCTGAATC CTGTCTTCCT CTCCTTCACT ATCTTGCTCA CTTGCTGCCT 180 
GAGGTCTCCA TCACCCCAAT CTGTAAGTCA GTTCCAGCTG TAGACCTTCT TCTGGACTTC 240 
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TTGGACATCC ATGTGCATCC ATGGACTTCT TGTGCATCAT TTGGGGCACA TACATACATA 300 

CAGGCAAAAC ACCCATATAT GTAAAATAAT TTTTTTTTAA GATTAGCACC TTAGTTTACT 360 

ttttgacagt tttttgcttg tttgcttggt TAGTTGGTTG GTTTGGCTTT TTGAGATAAC 420 

GTTTCTCTGT GTAGCTCTGG CTGTCCTGAA ATTCACACTA TGTAGACCAG GCTAGCTTTG 480 

AACCCACAGA GATCTGGCTG CCTCTGCTGG GATTAAAGCA TAGGCCACCA CGATTGGCTT 540 

TGAGAAGAAA TAAACTAATA AAGCAAAtAA AGGCCAATTT TATGTCACTC TATTTTATAA 600 

CGATAAACAT GACTTAGTTA TGAGGGAACT GAGACAGGAC ACTTCTAAGA CACCTGTGTT 660 

TCTGACAACA GCCTTACCTC AGAGCAGCAT GTAAAATATT TTAAAAAGGG AAAAAGCCAG 720 

GTGTGGTAAT GCTCACCTGT AATCCTAGCC CTCTGGAAGT TAAGGCAGGA AAATCATGAG 780 

20 TCTAGACTAC AAAAAGAAAA AATAAAGGTC AGCTCCACAT ACGAGATCAA CAACCTCCAC 840 

CACCTAAACT TGGTCCTATG GCTCAGGGGC TGGGGCGGGC TGG AATAGTT AACTGCTGAA 900 

AAAACGGGAT CCAGTAACTT TTCCCTTTAA GCTCTTTTTT TATACCCTGT GTTTTGATCG 960 

TGAGCATTGC CCATATGGGA AGATAGTGAf TATTTTAGCG TCCAGTTCCC AACGGCCCCC 102O 

TTGGAAAGCC AAGGTCTTGC TCCAGCGCCT TCTCAGAGTG TGPPTTGGGG TGTTCCTTTT 1080 

TCTCCTTCCT GCCTTTCAGG CCCCAGAGTG TTGTTTTACT GCTCGCCCTG GCTGTGTCCT 1140 

CAGGACTGGA ACCACATCTT GGAAAAAGTC CCAGWGGTT TGAAGCCAAG AGCCTGGTAG 1200 

TTGTTGATTT TGACTAGAGA TGATGCGGTT TATTTATTTG TTTTCTGCCT CATTTCACCA 1260 

AGCCAACTGA AGGCTAGGGC GTGAAGGCCA TTCCCTTATG TGTTCTAGGG ATGCTCGCAA 1320 

AGCCCCAGCA ATGTCCTTCT TAGCAACTTC ATAGAAATGT CCTATACCTT CACAGCAATG 1380 

TCCTTCTACA CCCTGTTTCT TTGGATTGCA GATGACTAGA GCCAACATCA AGGACCTTTA 1440 

GAGGAAAAAG GACTTTCATG ATCCAAGTAG GGGACAAAGC CTGACAAGGG CCTAGGAGTT 1500 

CGAGCTGGTG AC - . -V 1512 

45 

Claims 

so 1. An isolated DNA sequence comprising all or part of the 5' flanking region of a desmin gene located between 
approximately nucleotides -10000 to -1000 relative to the transcription initiation site of said desmin gene or being 
hybridizable under stringent conditions to said 5' flanking region. 

2. An isolated DNA sequence according to claim 1 , wherein said 5* flanking region is located between nucleotides - 
55 5000 to -1 000 relative to the transcription initiation site of said desmin gene. 

3. An isolated DNA sequence according to claim 1 or 2, wherein said desmin gene is from human, canine, avian, 
bovine, murine, ovine, feline, porcine or simian origin. 
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4. An isolated DNA sequence according to claim 3, wherein said 5* flanking region is located between approximately 
nucleotides -4006 to -2495 relative to the transcription initiation site of the mouse desmin gene. 

5. An isolated DNA sequence according claim 4, wherein said DNA sequence comprises all or part of the sequence 
5 shown in SEQ ID NO: 1 or a sequence being hybridizable under stringent conditions. 

6. An isolated DNA sequence according to claim 3, wherein said 5' flanking region is located between approximately 
nucleotides -4006 to +603 relative to the transcription initiation site of the mouse desmin gene. 

io 7. An expression cassette comprising a gene of interest placed under the control of the elements necessary to its 
expression in an eukaryotic host cell, said elements comprising at least an isolated DNA sequence according to 
claims 1 to 6. 

8. An expression cassette according to claim 7, wherein said elements further comprise a promoter, preferably a min- 
15 imal promoter, operativeiy linked to said isolated DNA sequence. 

9. An expression cassette according to claim 8, wherein said minimal promoter is derived from a desmin gene and 
comprised in the 5' flanking region immediately upstream of the transcription initiation site of said desmin gene. 

20 10. An expression cassette according to claim 9, wherein said minimal promoter is comprised between approximately 
nucleotide -87 to -1 of the 5* flanking region of the mouse desmin gene. 

11 . An expression cassette according to any of claims 7 to 1 0, wherein said elements further comprise at least an exon, 
preferably a non coding exon, and optionally an intron. 

25 

12. An expression cassette according to claim 11, wherein said exon is non coding and extends to approximately 
nucleotide +79 of the mouse desmin gene or comprises coding sequence and extends to approximately nucleotide 
+603 of the mouse desmin gene. . 

30 13. An expression cassette according to any of claims 7 to 12, wherein said gene of interest encodes an antisense 
RNA, a ribozyme or a polypeptide. 

14. An expression cassette according to claim 13, wherein said polypeptide is selected from the group consisting of a 
cytokine, a chemokine, a membrane or nuclear receptor, a ligand, a clotting factor, a clotting inhibitor, the CFTR 

35 protein, insulin, dystrophin, a growth factor, an enzyme, an enzyme inhibitor, an apoplosis inducer, an apoptosis 
inhibitor, a cytostatic agent, an apoilpoprotein, an oxygen radical scaveyer, a cell-cycle control polypeptide, an anti- 
proliferative polypeptide, an angiogenesis inhibitor, an immunogen, a antiviral polypeptide, an antibody, an antibody 
fragment, a toxin, an immunotoxin, a product of a suicide gene, a muscle contraction inhibiting polypeptide and a 
marker. 

40 

1 5. A vector comprising an expression cassette according to any of claims 7 to 14. 

16. A vector according to claim 15, wherein said vector is selected among plasmid, synthetic and viral vectors. 

45 17. A vector according to claim 16, wherein said vector is an adenoviral vector. 

18. A vector according to claim 17, wherein said adenoviral vector is replication-defective by deletion of at least all or 
part of the E1 region. 

so 1 9. A viral particle comprising an expression cassette according to any of claims 7 to 14 or a vector according to any 
of claims 15 to 18. 

20. An eukaryotic host cell, wherein said cell contains an expression cassette according to any of claims 7 to 14 or a 
vector according to any of claims 15 to 18 or is infected by a viral particle according to claim 19. 

55 

21. An eukaryotic host cell according to claim 20, wherein said cell is a muscle cell, preferably a smooth muscle cell. 

22. An eukaryotic host cell according to claim 21, wherein said cell is a vascular smooth muscle cell. 
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23. An eukaryotic host cell according to claim 22, wherein said cell is a arterial smooth muscle cell. 

24. A composition comprising an expression cassette according to any of claims 7 to 14, a vector according to any of 
claims 15 to 18, a viral particle according to claim 19 or an eukaryotic cell according to any of claims 20 to 23. 

25. A composition according to claim 24, further comprising a substance facilitating transection or stability, such as a 
gel of a complex of poly-lysine and lactose or polyoxamer 407. 

26. A pharmaceutical composition comprising an expression cassette according to any of claims 7 to 14 p a vector 
according to any of claims 15 to 18, a viral particle according to claim 19 or an eukaryotic cell according to any of 
claims 20 to 23 and a pharmaceutical^ acceptable vehicle. 



27. Use of an expression cassette according to any of claims 7 to 14, a vector according to any of claims 15 to 18, a 
viral particle according to claim 19 or an eukaryotic cell according to any of claims 20 to 23 to the manufacture of 

15 a drug intended for the treatment or prevention of a cardiovascular disease. 

28. Use according to claim 27, wherein the cardiovascular disease is selected among the group consisting of resteno- 
sis, ischemia, intimal hyperplasia, atherosclerosis, hypercholesterolemia and smooth muscle cell proliferation. 



20 



29. Transgenic animal comprising an expression cassette according to any of claims 7 to 14 or a vector according to 
any of claims 1 5 to 1 8. 



30. Use of a DNA sequence comprising all or part of a sequence located between approximately nucleotides -10000 
to +600 of a desmin gene and preferable -4006 to +603 of the mouse desmin gene, to target expression of a gene 

25 of interest operatively linked to said DNA sequence in a muscle cell selected among the group consisting of car- 
diac, skeletal and smooth muscle cell. 

31. Use of a DNA sequence comprising all or part of a sequence located between approximately nucleotides -5000 to 
-2400 of a desmin gene and preferably -4006 to -2495 of the mouse desmin gene, to target expression of a gene 

30 of interest operatively linked to said DNA sequence and to a minimal promoter in an arterial smooth muscle cell. 



35 



40 



45 
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- Figure 1 - 



Mouse -3168 ACCACCTAAACTTGGT . , C CTATCCCT <>GGG GCTOG GGCGGG CTG GAATA GTTAACT GCTGAAAAAACGGGA TCCAG . . TAaCTTTTCCCTTTAaGCTC -30?3 

|| |fj !|M [JH tl II 1U I HIM II M IT HI fill U I l lllll l JMI U I 

Haman -Z93S ACAACCAAA^CCTGGTTGTCTCTGMGCACrcrrTTGGGCTCCCTGCG TGTCAAAGGGGGAAGGAGACTTACTTTTCATTTTATGC. C -2550 

Mouse -3073 TTTTmATACCaCTCTmc.ATCGTCAfiCATTCCCUTATCCCAAWTACT -2977 

inn i t i ii inn mi ii i urn in n un i mi in i n n 11 n t n 

Human -2849 TTTTrGCACATCTTGAATTTTGTACCAT ATACACTAC^ CATCCGAGTGGTCCTCTCGG r AGCCT -27S7. 



Mouse 
'Human 
Mouse 
Human 

MOUSC 

Human 
Mouic 
Human 
Mouse 
Ha man 




-2785 G, 



-2552 



-2691 § 



-2684- 
-2658 

" f% HP 

^^^^ , 

lift i I it mi nr mi ii 

^kCCrCTAGAGTCCCCrrCACGAACATTAAT CTf.GTTCTTT -2379 

-2604 GGATTGC A GATGA CT AGAGCCAA CATGiA GGAC^^^^^^M^^ ^ ^A CATGATCCAACT ACG G G ACAAA GCCTCA CAA GGGC C TAGGA G -2509 

ii it i iii i ii in i in in in i nit i j i mi. J i ni i mi I I I 

-2378 KGCTgTMCTGATCAAAG.CAATACCAAATGCCr^ • iAGTG -2283 

Mouse -2588 TTCGAGCTGGTGACCACACCCAG, . . .GCCAACGACCGAAGGAA GCCTCTTCCCCAGCTTTCCATGTGTAGCCAGAGCAAGTT -243C 

1 Mil M I III M » M* II 1 Mill ' MM M 11111 I f M I MM 

Human -Z282 TGGGAGC. ^.TCAACTCACAMGTTGAGGGAAGCACTGCAGGMCCMGGGGCTG • ■ .CTTACATATT -2193 

Moose -2429 CACCCCAC CCCACCCCAAACAGAAGTCAGAACA . . . GGCGACCACAGG. ACCAGTTTAGCAAACCCCAATTrATTCA . . .AGATC -2342 

,ui ii m ii iii i in ii .in i in n i mum i fi in ( i milium u it 
Human -2189 gacctctctttcttcctactcccccagggggcaggam -2090 

Mouse -2341 TCAGCTTGACCCTGCTCAGAACC. . .CACaCCCCTTGAGCCAATTCTTACAGTCAGGAAGATGC AATGGCAGGGATCTGAGTCATCTATCT -2254 

|| f 111 Mil MM I J I II HUM III I I liriMI MMii 1 (Ml M Ml I Ml 11 
Human -2089 TCTGATTGG£CCTACTCATTCCCAGKCCATTCOT m ml99B 



-2253 ACTCGTTGAG, 
1 II III 



. CTT CAAGAAGTGAACTCTGT CaTCTGAGA GAAGGACAAGGGCCAGAA . . 
1 11 1 1 lilllH 1 1 I 1 l; MM 



W^^CACAA^TCAAGG* 

SS-vw^ww *"iTt r 1 1 ■ u ' 




2*76 

2*76 
1798 
1980 
-1698 
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■ Figure I - 
(Cont.:) 



CArO-l 



Mouse -1437 TCGCACAACTGGCCACTCACACCCGTCCTCTTCCCACCAC 

i unit inn nil ii 111 n i iiihi 

Human -1542 TTGGAGAATTGGGG, 



GATCAG7 oCATATAAGG >TTATTCTTATTT<nTGCTCTATGC 



I till] 



GTCATGGGTGTGAACTGCTCAGCAGCTT GGCCCACACTAGCTTGCTCAGl pCATflTAAGGf ATTA 



1111 



13SS 
Hill! | If ! 
TTGTTGCTACATAC -14S1 



Mouse -1354 TTTAT7GTCTCCCTGGCCTGTTCAAACOTCATTGCACAGTCATGCGACATGGTTT GGC CCCCACTCTATGACT A -1280 

in i ii inn ii m i n i n * mm rn n mi in n 

Human -J450 ATT AGT AGGGCCTGGGCCTCTTTAAA ECTTT ATa59 *TAG CATGGCAACGCTMCGltCCrCACTTTATATCTCACAAGCTGGGGC'l CAGAG -13S» 

CAiGbox ■ C/EBP Bicold 
Mouse -1279 AGGAC. . . CCTGAGAGGTACCCAGGACAAGGACATGCCTfTTGCCACCGCn'. ... . . TCTCTCTGACCAGCCCTEGGTAGGAtTA CCCCTfOAA -1153 

inn nun t i finitiiii iii f I M J i i ii i i i himnl i iiiilini 

Human -135S AGGACGTKCTGAGCTGCGGCrCAGACMGGACACACCTACTAGTMCCCCTO -0259 
Mouse -1132 TC3i\GGGCTCCATrTCCAAGGfTGftGCTATG6ACCCCT AATCT AGTACrCTATCTCCAGGGTTTGTGTrCTACCAGGGTTCAGGGCTCAGTCAAGACCTG -1693 

ii ii in mi ti i nt i I i mi ti in i ii n mi ii i 

Human "1258 TX^CCACTCTATTTTCAGGTTTG CATTGGAAAAGGCTCAGTCCAAGAGGGACCTCGACAGCCAAGTCCACGrGT A -U83 



Mouse 
Human 
Mouse 

HUTTMrt 

Mouse 
Human 
Mouse 
Human 
Mouse 
Human 



1692 CAGCAAGG TGCACAGAGAAATCTAGATGCCCTTAGGG . TCGGTGTGCGCCrrGTGCTTCACGG . ..... . AAGAGAGAGGCGAACACACCCCAG 

Mil II || Kill II 1111 1 1 1 1 ! I it (1 I 111 Willi II 111 III I I I I 

•11B2 GAGCACGGCCAGTACCCATGGAGMTGGTt^TGTCCrrAGGGGnAGCAAjGrGCCGT GTGCT AAGGAGGGGCCTTT CGAGCTTCCCAGGCCCTCTGTGG 

PEA3 

•1006 GCC7TCATCTCTCTT0. , . .GGGGTCTGGA GTGACTGAAGCTTCTCGCTGTC AGCf 

[ Ii III t III t till 11111 CArCboX til I Ml llllll H! 



-13&2 CGCTCCATTTTTCrGCCGgrCGGGCCTGGA 



-935 CCTCAGGH 
II 1 1 1 1 E 1 
-952 CCTCAGGT 



-S35 



iGCrGGCAAtnUATTGGGGCrCTCACCCTAGCLrrcCTTTlTCrGAC 



TTO 

nun 



. GACACCTCTCTCT 
(III II II 




-S92 



-737 



-792 



&CGX£X\ 



•m7 



aw 



936 



836 



738 



-647 



Mouse -646 TCGTCAGA . AGATTCTG GAAACT ATCTTGCTGG CTATAAACTTGA - .GGGAAGCAGAACGCCAACATTCCTCCA.AGGGA -571 

i 1 1 1 1 ■ uiiniii i i in iiriiintii mi i hi iiii n i mi nm 

Human -693 TTATCAGACCCTTTCTGGAMTCAGCCCACnTn^ -S3* 

Mouse -570 AACTGAGGCTCAGAGTT AAAACCCAGGTAKAGTCATATGCATG . , TCCCCCGGCCAOGGTCACT . CTCTGACTAACCGGTACCTACCCTACAGGCCTAC -474 

L t 1 1 11 i 1 1 1 1 M || II Ii ii ii mi n fllllll II II I I E I I I III Hit I 111 t 

Human -593 AACTGAGGCTC A GGGCCAGCTCGC CCATAGA CATACAT(iC^GCnTGG<:CAGGATC CCTCCCCCT GCCAGGCT CTC CCTGCCCTCC CTTCCTCC -499 

Moiiie. -473 CTAGAGACTCTTTT<^GGAT(^A^ -374 

Mliilll | I I III I II (Mil I I : IMIIII I III It 1 1 11 III II II 

Human CTAGAGACCC CCACCCTCAAGCCTGGCTGCTCTXrG..CCrGAGACCCAMCCTC^ -4C9 



Mouxe 

Human 

Mouse 

Human 

Mouse 

Human 

Mouse 

Human 

Mouse 

Human 

Mouse 

Human 



-373 GGGGGTCCTT CTTCTTCATT. . . CCMTAACCTAAMC£TCTCCTCGGjAGAAMTA^G5GCCTCAAAC . AAACGAAATTCTCT AG 

Ml llll IIII I n m mi m II I f 1 1 1 I Mil Ml tlllllll Hi I II 1(11 

-40ft AGGG. . CCnTCCTGGGMCAG<i£CTTGACCCTreAA 

CArGbox 

-291 CCCGCTTT CCCCAGGATM&GCAGGCATCbUATCGAAAAAAAGWGCCGGCCGGKSGT CTCCTGTCA GCTC 

i hi i iimm in it i i:u it mi tin n im 11 i i ir m 

-310 TCrGCTCTCTCCCCrGAC^CACTCCCCCCCACCCCAGCACGAGG.AGA TAACCA. . .CGGCTAAAGA6GCC. GCTCGGGGCTGCAGA CATGCTTGCTG 

ccuGCAGaCTKaCTcn-aGTcacnGGf^"^"' — ... 



Z92 



•311 



221 



-220 CTTGCCCTGTGAAA 

i timn n 

-215 CCTKCCr<XCGaAGGATTGGTAGGCrTGCC 



-123 




-US 



+70 



+75 
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■ Figure 2 



CArGti 



Oct 



-4006 G CtoATTfmTnfAjgaAA ll GAGCCATTC 

EcoRF G^faT" 
-3956 ATCCTGTACT CAACCATCCC 



Spl 

CCAGCAC ICCT AGCCCCft CCA 



TAGAGGTTTT AGAAAA KCTT 

TTTCGAATC CTGTCTTCCT 

GAGGTCTCCA TCACCCCAAT 

-3806 CTGTAAGTCA GTTCCAGCTG TAGACCTTCT TCTGGACTTC TTGGACATCC 

-3756 ATGTGCATCC ATGGACTTG" TGTGCATCAT TTGGGGCACA TACATACATA 
Kfa?* CArg4 ' 

-3706 CAGGCAAAAC A dcCATATAfr GW AATAAT 



-3906 TATACtfc CAA CCCCCAAaA AATGTGCTAC 

-3856 ctccttc 4ctatc8 tgctca CTTGCTGCCT 



-3656 TTAGTTTACT TTTTGACAGT TTTTTGCTTG TTTGCTTGGT TAGTTGGTTG 
-3606 CTTTGGCTTT TT&GATAAK! GTTTCTCTGT GTAGCTCTGG CTGTCCTGAA 



TTTTTTTAA GATTAGCACC 



-3556 ATTCACACTA TGTAGACCAG GCTAGCTTTG AACCCACAGA GATCTGGCTG 

NF-1 

-3506 CCTCTGCTGG GATTAAAGCA TAGGCCACCA C flATTGGCTT T TGAGAAGAAA 

CAHS3 ^ NVxXS 
-3456 TAAAGTAATA AA dCAAATAA AGCfc fcAATTf TAtGTl^C TtfTTTT ATAA 



-3406 CGATAAACAT GACTTAGTTA TGAGGGAACT GAGACAGGAC ACTTCTAAGA 

C/EBP : 

-3356 CACCTGTGTT TCTGACAACA G dCTTACCTC Al GAGCAOeAT GTAAAAT^E 

YY1 C/EBP 
-3306 TTAAAAAGd G AAAAAGCCAG flTGTGGTAAT1 GCTCACCTGT AATCCTAGCC 

-3256 CTCTGGAAGT TAAGGCAGGA AAATCATGAG Tto^CTAC AAAAAGAAAA 

-3206 AATAAAGGTC AGCTCCACAT ACGAGATCAA CAACCTCCAC CACCTAAACT 

SpI Spl ; 

-3156 TGGTCCTATG GCTC AlGGGGC TlGGflGCGGd c -TGGAATAG7T AACTGCTGAA 



-3106 AAAACGGGAT CCACTAACTT TTCCCTTTAA GCTCTTTTTT TATACCCTGT 
-3056 GTTTTGATCG TGAGCATTGC CCATATGGGA lAfiAIAfijTGAT TATTTTAGCG 



-3006 TCCAGTTCCC AACGGCCCCC TTGGAAAGCC AAGGTQTg£JLC£fll^GCCT 

-2956 TCTCAGAGTG TGTTTTGGGG TGTTCCTTTT TCTCCTTCCT GCCTTTCAGG 

AM 



NF-1 



TGTGTCCT CAGGAGTGGA 



-2906 CCCCAGAGTG TTGTTTTACT GC 

-2856 ACCACATOT GGAAAAAGTC CCAGrEfigPl TGAAftrf AAb AGCCTCGTAC 
AP-1 

-2606 TTGTTGATTT EfiACS&GA TGATGCGGTT TATTTATTTG TTTTCTGCCT 
-2756 CATTTCACCA AGCCAACTGA AGGCTAGGGC GTGAAGGCCA^ TTCCO TATG 
-2706 TGTTCTAGGG ATGCTCGCAA AGCCCCAGCA ATGTjCQICL TAfiTAiACTTC 
-2656 ATAGAAATGT CCTATACCTT CACAGCAATG TCCTTCTACA CCCTGTTTCT 
-2606 TTGGATTGCA G^LGACIA^ GCCAACATCA AGGAlCU1AJj&G0AAAAAG 
-2556 GAGTTTCATG ATCCAAGTAG GGGACAAAGC CTGACAAGGG CCTAGGAGTT 

-2506 cGAGcrdnnLAr; -2495 

BftEUbtont 
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